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' Abstract 

Luca Trevisan [Tj presented an approximation algorithm for Max Cut based on spectral partitioning 
^ , techniques. He proved that the algorithm has an approximation ratio of at least 0.531. We improve this 

bound up to 0.6142. We also define and extend this result for the more general Maximum Colored Cut 
problem. 
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1 Introduction 

Given a graph G and a nonnegative weight function on the edges of G, the maximum cut problem is that 
C/3 , of finding a partition of the vertices in two sets such that the total weight of the edges crossing the cut 

^ ' is maximized. It is easy to find a solution such that the weight of the cut is at least half of the optimum 

(for instance, a random partition cuts half of the total weight of the graph). The algorithm of Goemans 
and Williamson [2], based on semidefinite programming, yields a 0.87856 • • • approximation for this problem. 
Until recently, the only known methods giving a guarantee of more than a half required solving a semidefinite 
program. In [T], Trevisan presented an algorithm based on spectral partitioning with approximation factor 
, bounded away from half. In that paper, it was shown that the algorithm has an approximation ratio of at 

• least 0.531. We present a new analysis that improves that bound up to 0.6142. We also show how to apply 

this algorithm to the slightly more general maximum colored cut problem. 
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O ■ 2 Algorithm idea 

In the Maximum Colored Cut problem (MaxCC), we are given a graph G = {V, E) with E = RU B, where 
^ ■ the ed ges are colored either red or blue, and a nonnegative weight function w : E M+. We want to 

^ I partition V into two sets V- and V+ such that the weight of the red edges that are cut by the partition plus 

the weight of the blue edges that are uncut is maximized. This problem generalizes the Max Cut problem 
(in which B = 0). 

Given a bipartition of V, we say that a red edge is good, if it belongs to the associated cut, and that a 
blue edge is good if it is not cut. Any edge which is not good will be denoted as bad. With this notation, 
the objective of the problem is to find a partition that maximizes the total weight of good edges. 

Luca Trevisan's approach ^ to construct an approximation algorithm for Max Cut is the following: 
Device an algorithm A that partitions V into three sets V+,Vb,VL, where Vq is the set of nodes that are 
still "undecided" between and V- , and use A recursively in the set of undecided nodes until every node 
is decided. For a given tripartition, the edges of the graph induced by V+ U V- are already labeled as good 
or bad. Also, no matter what the recursive partition of the undecided vertices into two pieces L and R is, 
we can always impose that half of the total weight of the edges between V+ U V- and Vq = LU R is good 
(by assigning L to V+ and R to V- or viceversa). This observation suggests that the objective of A should 
be to find a tripartition for which the ratio of the weight of the good edges induced by V+ U V- plus half 
of the total weight crossing from V+ U V- to Vq with respect to the total weight of the edges involved (the 
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ones incident in V+ U VL) is as high as possible. We call this ratio the recoverable ratio of the partition. As 
we will see later in the paper, we can find a partition with high recoverable ratio by using spectral partition 
techniques. 

The algorithm we describe is the same as the one in [I], so we assume familiarity with that paper. The 
previous analysis for the approximation ratio of that algorithm involved upper bounding the number of uncut 
(bad) edges in each iteration via an application of Cauchy-Schwartz inequality. We give a tighter analysis 
by directly lower bounding the number of cut (good) edges. 



3 Some notation 



For a graph G = {V,E) with n vertices, a set of edges F C E, and a set of vertices A C V, we use F[A] 
to denote the set of edges in F with both endpoints inside of A, similarly, for disjoint Ai,A2 C V we use 
F{Ai : A2) to denote the set of edges in F with one cndpoint in Ai and the other in A2. We define the 
indicator vector of a bipartition {V+, VL} of V to be the vector x £ {—1,1}" such that Xi — 1 ii i E V+ 
and Xi — —1 a i £ VL. Similarly we say that x E {—1,0,1}" is the indicator vector of the tripartition 
{V+, Vq, V-} of V if Xi is 1, or -1 whenever i is in V+, Vq or V- respectively. As usual, for a weight function 
w : E R+, and F C E, we use w{F) as a shorthand for X^esB ^if )- ^'^^ ^'^^^ ^^^^ paper, we fix a 
graph G = (V, E) where |y| = n and E = RU B, and a weight function w : E R+. 
The following quadratic formulation gives the value of the maximum colored cut. 

max — Wij{xi — Xj)'^ -\ — Wij{xi + Xj)'^ . (MaxCC) 

Let A/^*'-'^ G R"^" be the matrix associated to the quadratic form involving edge {z, j} for MaxCC: 



x^ M^^'^^x 



w^j{xi-XjY if{«>i}'Si?, 



By letting M = j}e_E -^^ijj MaxCC can be expressed as max^g{_i ^x'^ Mx. It will also be convenient 
to define matrices £)f''^} such that 

x^D^'^^"^x:^w^j{xl+x]), 

and let D — -^^^ D^*'-'^. In particular note that if a; e { — 1, 1}" then x^Dx = 2w{E). It is also easy to 
check that 

M = Deg(i?) - Adj(i?) + Deg(i?) + Adj(i3), 
D = Deg(i?) +Deg(B). 

where Adj(ii') and 'Deg{K) are the weighted adjacency and degree matrices associated to the edge set K. 

In the following lemma, we relate the value of the MaxCC to the highest eigenvalue of a certain positive 
semidefinite matrix: 

Lemma 3.1. If MaxCC > (1 — e)w{E), then there exists a vector x £ R" \ {0} such that 

x^Mx 

>2(l-e). 



x'^Dx 



Proof. Let x be the vector solving uiaxxeK" x'^ Mx/x^ Dx, and x* E {—1,1}" be the incident vector of a 
maximum colored cut. Then 

x^Mx x*^Mx* _ 4MaxCC 

- x*^Dx* ~ 2w{E) - ' 

To find X we can simply find a unit vector y associated to the maximum eigenvalue of the positive semidefinite 
matrix D^^/^MD^^^^, and then set x = D^^^^y. □ 
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In practice, x can not be found exactly, however, it is know that we can find efficiently a vector x such 
that x^ Mx > 2(1 — e — 6)x'^Dx in time inverse proportional to 6 (see the discussion of Lemma 2 in [I]). In 
the next section we will show how to derive a good tripartition, this is, one having a good recoverable ratio, 
using this vector as a starting point. 



4 Finding a good tripartition 

Let z E {—1, 0, 1}" be an indicator vector of a partition Vq, V+, such that V ^ Vq. Define the weights of 
the good, bad, crossing and incident edges associated to the partition as: 



Good(z) 
Bad(z) 
Cross(z) 
Inc(2) 



= w{R{V^ : V+)) + w{B[V^] U B[V+]). 
^ w{B{V- : V+)) + w{R[V-] U R[V+]). 
= w{E{V- UV+ : Vo)). 
^ w{E) - w{E[Vq]). 



Note that Good(z) + Bad(z) + Cross(z) — Inc(2;). The recoverable ratio of z is defined to be (Good(z) + 
Cross(z)/2) divided by Inc(z). In particular, if z is the indicator vector of the optimal MaxCC, the recovery 
ratio is exactly the ratio MaxCC/w{E) = z^Mz/2z^Mz. 

Given a vector x G K" such that the ratio x'^ Mx/x'^ Dx is big, we will be able to find a rounded vector 
z G { — 1,0, 1}" with high recoverable ratio. The following lemma, which can be seen as an improvement of 
Lemma 3 in [1], will be useful for this task. 

Lemma 4.1. Let x E R", with ||a;||cc- = 1- Construct y G {—1, 0, 1}" as follows. Pick t G_r [0, 1] uniformly 
at random and let, for each i, 



Vi = 



-1, 



if Xi > \Jt. 
if Xi < —\fi. 
if \xi\ < Vi. 



Let 





= Pr 








U{i,j) 


= Pr 






\Vi 




= Pr 
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he the probabilities that an edge {i,j} is cut, uncut or crossing the tripartition induced by y. Then for all 
< /? < 1, 



U{i,j)+/3X{t,j)>(3{l~p){x,+x,f, 
C{i,j) + pX{i,j) > - /3)(x, - xj)\ 



(4.1) 
(4.2) 



Proof. Note first that (|4.2p follows if we apply (|4.H) to the vector x' obtained from x by switching the sign 
of Xj. To prove (|4.ip we consider two cases. 

Case 1: If Xi ■ Xj > 0. Assume, w.l.o.g. that \xi\ < \xj\. In this case, U{i,j) is equal to the probability 
that both Xi and x^ are bigger than t, thus U{i, j) — xf. On the other hand, X{i, j) is equal to the probability 
that t is between xf and x'j, this is X{i,j) = {x'j — xf). 

Using the inequality (3{1 - (3){a + h)"^ < (1 - (3)0^ + vahd for a, 6 > and < /3 < 1, we get 



P{l-f3){x,+x,f =f}{l-P){\x^\ + \x^f < {1 - I3)xf + f3x^^ ^ U{t,j) + pX{z,j). 
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Case 2: If Xi -Xj < 0. Assume again, w.l.o.g. that \xi\ < \xj\. In this case it is easy to sec that U{i,j) = 
and that X{i,j) = (xj — x'j). 

Since Xi ■ Xj < 0, we have \xj + Xi\ < \xj — Xi\, and so, using that < < 1 we get 

- P){x, + xjf < P\x, + xj\- \x^ -xj\^ I3(x] - xl) = U{i,j) + pX{i,j). 

□ 

The main result of this paper is given by the foUowing lemma. 

Lemma 4.2. Given a non-zero vector x such that x^ Mx > 2(1 — e) ■ x'^Dx, for some < e < 1/2, we can 
efficiently find a vector z £ {—1, 1,0}" \ {0} such that 

'-l + V4e2-8e + 5 



^ , Cross(0) ^ 
Good(z) + > < 



o(,_^\ Inc(z) ife>eo, 

; ' (4.3) 
Inc(z) if £ < £o- 



[1 + 2^I{T^) 



where en « 0.22815 ... is the unique solution of the equation : = — — ; — — . 

1 + 2V^(T^ 2(1 -e) 

Proof. Without loss of generality, assume that ||a;||oo = 1. Let y be the random vector obtained from 
X as in Lemma 14.11 and < /5 < 1 to be specified later. Then, using the previous lemma and that 
E[2Inc(y) — Cross(?/)] = x'^ Dx, we obtain 

E[Good(y) + /3Cross(y)] = ^ w,,,{C{i,j) + (3X{i,j)) + ^ w,,,{U{i,j)+(3X{t,j)) 

>(3{l-(3) mj{xi-Xj)^ + [3) WiJ{x^+XJf 

= P{1 - (3)x'^Mx > 2(1 - e)P{l ~ (3)x^Dx 
= 2(1 - e)/3(l - f3)E[2liic{y) - Cross(y)]. 

Rearranging terms, we get 

E[Good(?/)] + (/? + 2(1 - e)(3{l - /?)) E[Cross(?/)] > 4(1 - e)/3(l - /3)E[Inc(y)] . (4.4) 

Let < Po := ^ ~ ~ - Se + 5 < 1 ^ ^j^^^ _^ 2(1 - e)(3o{l ~ /3q) = 1/2. We have two ways to 

4(1 ^ £) 2 
proceed now. The first one is to set (3 — (3q and replace it in (|4.4p to obtain: 



E[Good(y)] + iE[Cross(y)] > 1 + Vfe^ + -jE[Inc(y)]. (4.5) 
2 2(1 — e) 

Observe that this bound is valid for all < e < 1/2 but it gets worse as e gets closer to (for every value 
of e, the optimal recovery ratio is at least (1 — e)). 

The second way to select j3 will give us a bound that is tighter when e is close to 0. Recall that 
Good(?/) + Bad(?/) + Cross(y) = Inc(y). Subtracting (|4.4p from this we get 

(1 - 4(1 - £)/3(l - /3))E[Inc(y)] > E[Bad(y)] + (l - /3 - 2(1 - e)/3(l - /?))E[Cross(y)]. (4.6) 

Observe that the coefladent in front of E[Cross(y)], 1 - /? - 2(1 - e)/3(l - ^) = (1 - /3)(1 - 2(1 - e)(3) is 
positive and at most 1/2 whenever /Jq < /? < 1/(2 ~ 2e). Assume that [3 is in that interval and divide the 
equation (jiTB)) by 2(1 - /3)(1 - 2(1 - e)l3) to obtain 

1 - 4(1 - em - P) p^^^^^j ^ E[Bad(y)] ^ lK[Cross(y)] > E[Bad(y)] + iE[Cross(y)]. 



2(l-/3)(l-2(l-e)/?) ' 2(l-/3)(l-2(l-e)/3) 2 ' - ' 2 
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Since Good(?;) + Bad(?;) + Cross(y) — Inc(y), we get 



E[Good(,)] + lE[Cross(,)] > (l - IB:[Inc(.)] ^ ^(T^^|i^|t^^P-(^)]- 

The function := — — — is nonnegative whenever /3 < 1/2 or 1/(2 - 2e) < /3 < 1. 

2(1 - P)(l - 2(1 - e)fj) 
Because of our previous assumption on /3, the only case of interest is when Po < /3 < 1/2. 

A little computation shows that F(/3) has a local maximum in /?* = - — < -, with value 

2 2 y 1 — e 2 

F(/3*) = . Hence, provided that Po < P* , we can set f3 = (3* to obtain 

l + 2^e{l-e) 

E[Good(y)] + iE[Cross(y)] > L ^ E[Inc(y)]. (4.7) 

2 1 + 2^2(1 -e) 

1 —1 + \/4e2 _ _|_ 5 

Let £o w 0.22815 ... be the unique solution of the equation , = ; . It is 

1 + 2V^(T^ 2(1 -e) 

easy to check that (3o < P* if and only if e < Eq. This, together with (|4.5p and (|4.7p . implies that the desired 

inequality, (|4.3p . holds in expectation and therefore, there exists a z G {—1, 0, 1}" satisfying it. In particular, 

we can find z by testing all n thresholds (a;|) for t, and keeping the one with greatest recoverable ratio. 

Furthermore, this vector z is not zero, since for the index i such that \xi\ = 1, we also have \zi\ = 1. □ 

Remark 4.3. Consider the function 



-1 + V4e2 - 8e + 5 

-, if e > £o. 



2(1-8) 

1 



if e < Eq. 



Lemmas 13.11 and 14.21 imply that if G has a bipartition with at least 1 ~ e fraction of the total weight being 
good ('cut'), then we can find a partition into three pieces V-,V+,Vo with recoverable ratio at least /(e). 
Previously, Luca Trevisan proved (analyzing Min Uncut) that the same is true for the function f{e) ~ 1—2^. 
In Figure [1] we can see the relationship between these two bounds. Note that we always have /(e) > /(e). 
Also, it is worth noting that a random bipartition gives a recoverable ratio of 1/2 in expectation. The 
previous guarantee only beats 1/2 when e < 1/16, the new guarantee implies that the algorithm is better 
than a random cut for any e < 1/3. 



5 Main algorithm 

For a given G = {V,E — R(J B) and nonnegative weight function w : E R+, the following algorithm 
returns a bipartition (VF+,VF_) that is a good approximation to MaxCC problem. (In what follows, we 
assume that the vector x given by Lemma l3.1l can be found exactly in order to keep the argument simpler). 

ALG{V, E) 

1. Compute X using Lemma |3. II and determine from it the vector z given by Lemma 14.21 

Good(z) + Cross(z) 12 

2. If — < 1/2, then return a bipartition W-) of V such that the weight of good 

Inc[z) 

edges is at least half of 'w{E) (a random cut suffices). 

3. Otherwise, let (V+,y_,Vb) be the tripartition induced by z. If Vb = return {V+^V-). Else, let 
(W+, iy_) ^ ALG(Vb, £;[yo]), and return the best of {V+ U W+,V- U W-) and (V^ U iy_, U W+). 
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Figure 1: The red curve (top) corresponds to the recoverable ratio guaranteed by f{e), the green one (lowest) 
to the one guaranteed by /(e). 



Theorem 5.1. If MaxCC{G) = (1 - e)w{E), then ALG{V,E) returns a partition {W_ 
y G { — 1, 1}" is the associated indicator vector, 



W+), such that, if 



Good(y) 

w{E) 



> 



max 



i,/(£/r) 1 dr 



with /(e) as in remark \4.3\ 



Proof. First, it is clear that the algorithm terminates since in every recursive call the residual graph consid- 
ered has at least one fewer node. Assume that the algorithm performs T recursive calls. Let Gt = {Vt,Et) 
be the graph at the beginning of the t-th iteration, so that Gi = {V,E). Let also Gt+i to be the empty 
graph and St = w{Et)/w{E) for every t. We observe that 



w{Et) - MaxCC(Gt) ^ w{E) 



w{Et) 



MaxCC(G) e 



6t ■ w{E) 



The previous holds since for the optimal partition defining MaxCC(G), the total weight of the bad edges 
is at least the weight of the bad edges inside the subgraph induced by Et- This quantity is itself at least 
the weight of the bad edges for the partition defining MaxCC(Gt). From this observation we get that 
MaxCC(Gt) > (1 — £ / 5t)w{Et) ■ Recall that a random cut has a recoverable ratio of at least half. Combining 
this with Remark 14.31 we get that inside Et \ £'t+i we are guaranteed a good weight of at least 



max 



f{e/5t) ■ w{Et \ Et+i). 



Using that / is decreasing, and that the sets Et \ Et+i are mutually disjoint, we obtain that the total good 
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weight returned by the algorithm is at least 

i=0 



Good(y ) ^ 
w(E) 



I max (]-J{£/5t)\ dr 
^ / max (^./(e/r)) dr 



1 a 



max [^-,/(£/r)j dr. 

□ 

Corollary 5.2. T/ie partition returned by ALG guarantees a 0.6142. . . approximation for MaxCC. 

Proof. Let F(e) — /J^ max (1/2, /(e/r)) dr. By Theorem 15.11 F{e) is the fraction of the weight guaranteed 
to be good ('cut') by the algorithm. Note that f{x) = 1/2 when x = 1/3. Using this, and the definition of 
/ it is easy to see that 

'1/2, ife>l/3. 



Fie) 







'^1^ , /-^ -l + v/4(£/r)2-8(£/r) + 5 ^ <r <- i 

7:dr + / — — dr, if eo < e < 1/3. 

2 Jse 2(1 - e/r) 

2 73, 2(1 -£/r) 7.,„ l + 2VeWl-e/r) 





with eo « 0.228155 . . ., as in Lemma l4?2l 

The guarantee of approximation of the algorithm is, then, the minimum of the function G(e) = -j^^ 
when £ is in the interval [0, 1/2]. We can check that the function G{e) is convex and has an unique minimum 
at e* w .11089 . . . with value G{e*) « 0.614247 ... □ 

For completeness, we include a closed form of function G and a plot comparing this guarantee with the 
previous guarantee of [1] for different regimes of e. 

5.1 Analytic expression of the guarantee function. 

If MaxCC(G') > w{E) ■ (1 — e), then the algorithm described before gives a G(e)-approximation for the 
MaxCC problem, with G{e) defined by the following expression: 
If £ > 1/3 

G(£) := ^ 



2(l-£)- 



If £o < £ < 1/3, 



G{e) := — • £ - 1 + V4£2 - 8£ + 5 - £ In 



1 + V4£2 - 8£ + 5 



2(l-£) V V 8£ 

V5 / 5-4£+ v/5(4£2-8£ + 5) ' 
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and if £ < eo, 



G{s) := 



1 



2(1 -e) 



3 



e 



£ ( 1 - — ) + 2 + — a/4£2 - 8£o + 5 - £ln 



1 + y/feg - 8£o + 5 



^£ln i^if^I±^MME8£^Ui6£lnf^+^ 
5 \ (ll + 5\/5]£n I ' Ve+ /— -e 



■8£ 



V£o(l-eo) + l-2£o g^y£(l - £) + 1 - 2£) 



£o + A/eo(l - £o) 
Where Eq ~ 0.22815 ... is the unique solution of the equation 



V£ + Vl - e /' 

1 -1 + V4e2 - 8£ + 5 



l + 2v'£(l-£) 



2(1 -e) 




0,1 0.2 0.3 0.4 0.5 



Figure 2: The red curve corresponds to G{e) = the black one bellow correspond to the guarantee H{e) 
given previously in [1 with minimum on 0.531: H{e) = li^j^t^, for e < 1/16 and H{e) = 2{i-e) ' '^'^'^ 
e > 1/16. 
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